Feeds to Scour
SubscribedAll
Scoured 18240 posts in 139.8 ms
Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores
arxiv.org·1d
🏆LLM Benchmarking
Preview
Report Post
Mistaken correlations: Why it's critical to move beyond overly aggregated machine-learning metrics
techxplore.com·17h
🎯BM25
Preview
Report Post
From 75% to 99.6%: The Math of LLM Ensembles
shibaprasadb.com·1d·
Discuss: Hacker News
🏆LLM Benchmarking
Preview
Report Post
I Just Evaluated 200 Applications and I Have No Idea If I Did It Right
speakandregret.michaelinzlicht.com·18h
🏆LLM Benchmarking
Preview
Report Post
Methodology
pewresearch.org·17h
📊Benchmarking Methodology
Preview
Report Post
Retrieve and Rerank: Personalized Search Without Leaving Postgres
paradedb.com·1d
👤Search Personalization
Preview
Report Post
Uncovering Unfaithful CoT in Deceptive Models
lesswrong.com·6h
🛡️AI Security
Preview
Report Post
High Accuracy ICM Calculations for Large Fields - Blog
holdemresources.net·18h
🏆LLM Benchmarking
Preview
Report Post
Microseasons
kevinsdias.com·10h
🍄Mycorrhizal Networks
Preview
Report Post
Same URL in AI Overviews and blue links counts as one Google Search Console impression
searchengineland.com·15h
💫Search UX
Preview
Report Post
Targeted Bets
maxim.usindic.us·11h
🎰Bandit Algorithms
Preview
Report Post
Measuring Data Maturity
prepend.com·1d
👨‍💻Software development practices
Preview
Report Post
Reasoning or Fluency? Dissecting Probabilistic Confidence in Best-of-N Selection
arxiv.org·1d
🏆LLM Benchmarking
Preview
Report Post
ChatGPT’s Laws of Machine Learning
shruggingface.com·1d
🛡️AI Security
Preview
Report Post
Jacobson's Rank
denvaar.dev·1d
🌳Data Structures
Preview
Report Post
Learning from Models
rodney.bearblog.dev·1d
🔍AI Interpretability
Preview
Report Post
AI recruiters: faster, cheaper, and still clueless
pksunkara.com·6h·
Discuss: Hacker News
🧹Spam Filters
Preview
Report Post
LLMs Under Siege: The Red Team Reality Check of 2026
eddieoz.com·12h·
Discuss: Hacker News
🏆LLM Benchmarking
Preview
Report Post
Home
av-comparatives.org·1d
🤖Home Assistant
Preview
Report Post
Microsoft OpenAI docs leak 📄, X open sources algo 👨‍💻, MCP vs Skills vs Agents 🤖
tldr.tech·1d
🧠Obsidian
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help